OPTIMAL ESTIMATION OF ENSEMBLE AVERAGES FROM A QUAN- 
TUM MEASUREMENT 



O 



Paolo Perinotti 1,2 and Giacomo M. D'Ariano 1 

1 Dipartimento di Fisica "A. Volta", 
via Bassi 6, 27100 Pavia, Italy 
2 perinotti@fisicavolta. unipv.it 

We consider the general measurement scenario in which the ensemble average of 
an operator is determined via suitable data-processing of the outcomes of a quantum 
measurement described by a POVM. After reviewing the optimization of data pro- 
cessing that minimizes the statistical error of the estimation, we provide a compact 
formula for the evaluation of the estimation error. 
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O : l. INTRODUCTION 

m . 

t-h 1 A measurement that can be performed in the lab is described by a POVM (acronym for 

Positive Operator- Valued Measure), namely a set of (generally nonorthogonal) positive 
operators Pi > 0, 1 < i < N which resolve the identity Y,iLi Pi — I similarly to the 
orthogonal projectors of an observable []]. The probability distribution of the ith outcome 

^ ' is given by the Born rule 

p(i\p) = Tr[P iP ] (1) 

p being the density operator of the state. By such a measurement one can experimentally 
Qh! determine the ensemble averages of (generally complex) operators X. Clearly this is 
possible if X can be expanded over the POVM elements (mathematically we denote this 
condition as X £ SpanjPj}^!^). This means that there exists a set of coefficients fi[X] 
such that 

CT- N 

>: X = Y,fi[X\Pi, X £ <S := Span{Pi} i=1)iV (2) 

When S = B{7i) (i. e. when all operators can be expanded over the POVM), then the 



measurement is informationally complete. Obviously, once the expansion ([2]) is established 
one can obtain the ensemble average of X by the following averaging 

N 

W = E/*[*M<Ip), (3) 

where the probability distribution is given in Eq. (PQ). 

The above general measurement procedure opens the problem of finding the coefficients 
fi[X] in Eq. (jSJ), namely the data-processing of the measurement outcomes needed to 
determine the ensemble average of X. In general the coefficients fi[X] are not unique (if 
N > dim(iS)), and one then wants to optimize the data-processing according to a practical 
criterion, typically minimizing the statistical error. This problem has been solved in the 
general case in [2], and its solution will be reviewed in this paper. Here in addition We 
will present a simple formula for the minimum estimation error for arbitrary operator X. 



Notice that although the processing functions are intrinsically linear in the definition 
(J2J), there is no guarantee that the optimal ones are linear in X. Remarkably, however, 
the optimal processing function is indeed linear in X, and depends only on the POVM 
and, in a Bayesian scheme, on the ensemble of possible input states. The derivation of 
the optimal data-processing function requires elementary notions of frame theory [3JII] 
and linear algebra [5], which will be introduced in the first part of the paper. 

2. MATHEMATICAL TOOLS 

A frame in a Hilbert space /C is a set of vectors {<p n } Q K> satisfying the property 

aM a <£ito»iv>>r< bm\ (4) 

n 

for all ip £ K., with fixed < a < b < oo. The starting theorem in frame theory states 
that the set {<p n } is a frame iff the positive operator, called frame operator 

f = EI^X<U (5) 

n 

is bounded and invertible. In this case we can define the canonical dual frame {Xn} by 
the following formula 

\Xn) = F- 1 \cp n } } (6) 

and all the vectors ip G K, can be written as a linear combination of the vectors {<p n } as 
follows 

IV>)=£l0n)<Xnh/>)- (7) 
n 

When the frame is made of linearly dependent vectors, the choice of the coefficients in 
the expansion Eq. ([7]) is not unique, and all alternate choices are provided by alternate 
dual frames {r] n }, classified by the relation [6] 

\Vn) = \Xn) + \S n )-^2\Sm)((f>m\Xn), ( 8 ) 
m 

where {5 n } C JC is an arbitrary set of vectors. This theorem is useful in our case because we 
can consider the POVM elements as vectors in the space of Hilbert-Schmidt operators — 
which for finite dimensional systems are all possible operators X — and they provide a 
frame in the space S. Frame theory then solves the problem of finding all possible sets 
of coefficients fi[X] in Eq. (j2J), which are simply given by the scalar products (Di\X) : = 
Tr[I?JX], {Di} being an alternate dual for the frame {Pi} C S. 

In order to answer the main question of the paper, namely which is the dual frame 
{Di} providing the minimum statistical error, we will first show that the statistical error 
can be written in terms of a norm for the vector of coefficients. Indeed, if we 

consider the ensemble of possible input states {pk,Pk}, we can define the statistical error 
in a fixed state pk, and use its average over the ensemble as a figure of merit. We have 

N 

5D{X):=Y,^[Pe p i\\fm\ 2 -{X) 2 e. (9) 
i=i 



where ps := J^kPkPk, and (X) 2 g := Y^kPk\^[PkX] | 2 . The second term in Eq.Q does not 
depend on the choice of the dual, then the minimization problem can be stated as the 
minimization of the norm 

N 

\\f[X}l:=^\MX}\ 2 , (10) 
i=i 

where 7Tj = Tr[PjX]. If we now consider the following linear map that takes vectors of 
coefficients to operators 

N 

A:c^£qP, (11) 

i=i 

its matrix elements are given by A mn j = {Pi) mn - One can easily prove that all generalized 
inverses V of A, satisfying ArA = A, have matrix elements T it7nn = (D*) mn where {A} 
is an alternate dual frame for {Pi}. The minimum noise can be obtained through the 
minimum norm generalized inverse T that must satisfy the relation [7] 

vrrA = A + r + 7r, (12) 

where it is the positive diagonal matrix with eigenvalues 7r». 

3. MINIMIZATION OF ERROR 

Since the minimum norm generalized inverse is unique and does not depend on the 
vector, the optimal dual does not depend on X, and the function fi[X] = (Di\X) is linear, 
as anticipated. One can prove that the optimal dual frame {A} corresponding to such Y 
is unique and can be expressed as follows [2] 

A = A, t - £{[(/ - MM/ - M)Mv*i, (13) 
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where {Aj} is the canonical dual and M is the projection matrix with elements My = 
TrfAjPj]. The minimum noise for X can be expressed as 

5 D (X) = (X\ThT\X) - Jxy £ = (X\ felt^PdlAXAl) \X)-Wfs- (14) 
On the other hand, one can prove the following identity 

N 

rVTATr^A 1 " = r t A t r t A t = r f A f = Y, I A) (PI = is- (is) 

This implies that T^tcT = (A7r _1 AT) _1 , and finally, one can express the minimum noise in 
terms of the POVM and the ensemble only, as follows 

8 D {X) = (XKAtt-W)- 1 !*) - Wf E . (16) 

The optimal dual has been obtained in a completely different framework in [S] in the 
particular case when ps — |, and the figure of merit considered therein is the Hilbert- 
Schmidt distance between the estimated state and the true state. 



4. CONCLUSION 



In this paper we reviewed the problem of estimation of ensemble averages of operators 
by indirect measurements, through a fixed measurement whose statistics is described by a 
POVM {Pi}. The coefficients for the expansion of of an operator on the POVM elements 
provide the processing functions, and their calculation is possible in principle by using 
elementary results in frame theory. The difficult problem is to decide which processing 
function is the best in order to minimize the statistical error in the estimation of the 
ensemble average. We restated this problem as the inversion of a rectangular matrix with 
the constraint of minimum norm. We reviewed the general solution derived in Ref. [2], 
and we present a synthetic formula for the evaluation of the minimum noise in terms of 
the POVM elements and the input ensemble. 

ACKNOWLEDGMENTS 

P. P. thanks A. J. Scott for an interesting exchange of e-mails, which inspired Eq. 
f fT6|) . This work has been supported by Ministero Italiano deH'Universita e della Ricerca 
(MIUR) through PRIN 2005. P. P. acknowledges financial support by EC under project 
SECOQC (contract n. IST-2003-506813) 

REFERENCES 

[1] J. Von Neumann, Mathematical Principles of Quantum Mechanics, (Princeton Uni- 
versity Press, Princeton, 1955). 

[2] G. M. DAriano and P. Perinotti, Phys. Rev. Lett. 98, 020403 (2007). 

[3] R. J. Duffin and A. C. Schaeffer, Trans. Am. Math. Soc. 72, 341 (1952); P. G. 
Casazza, Taiw. J. Math. 4, 129 (2000). 

[4] P. Casazza, D. Han, and D. R. Larson, Contemp. Math. 247, 149 (1999). 

[5] R. B. Bhapat, Linear Algebra and Linear Models, ( Springer- Verlag, New York, 2000). 

[6] S. Li, Numer. Funct. Anal. Optim. 16, 1181 (1995). 

[7] G. M. DAriano, P. Perinotti, and M. F. Sacchi, Phys. Rev. A 72, 042108 (2005). 

[8] A. J. Scott, J. Phys. A 39, 13507 (2006). 



